Picture for Chhavi Yadav

Chhavi Yadav

Open-Weight LLM Fine-Tuning Defenses are Susceptible to Simple Attacks

Add code
May 26, 2026
Viaarxiv icon

Curriculum Learning for Safety Alignment

Add code
May 25, 2026
Viaarxiv icon

Research in Collaborative Learning Does Not Serve Cross-Silo Federated Learning in Practice

Add code
Oct 14, 2025
Viaarxiv icon

Can We Infer Confidential Properties of Training Data from LLMs?

Add code
Jun 12, 2025
Viaarxiv icon

ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

Add code
Feb 06, 2025
Figure 1 for ExpProof : Operationalizing Explanations for Confidential Models with ZKPs
Figure 2 for ExpProof : Operationalizing Explanations for Confidential Models with ZKPs
Figure 3 for ExpProof : Operationalizing Explanations for Confidential Models with ZKPs
Figure 4 for ExpProof : Operationalizing Explanations for Confidential Models with ZKPs
Viaarxiv icon

Evaluating Deep Unlearning in Large Language Models

Add code
Oct 19, 2024
Figure 1 for Evaluating Deep Unlearning in Large Language Models
Figure 2 for Evaluating Deep Unlearning in Large Language Models
Figure 3 for Evaluating Deep Unlearning in Large Language Models
Figure 4 for Evaluating Deep Unlearning in Large Language Models
Viaarxiv icon

Influence-based Attributions can be Manipulated

Add code
Sep 10, 2024
Viaarxiv icon

FairProof : Confidential and Certifiable Fairness for Neural Networks

Add code
Feb 19, 2024
Figure 1 for FairProof : Confidential and Certifiable Fairness for Neural Networks
Figure 2 for FairProof : Confidential and Certifiable Fairness for Neural Networks
Figure 3 for FairProof : Confidential and Certifiable Fairness for Neural Networks
Figure 4 for FairProof : Confidential and Certifiable Fairness for Neural Networks
Viaarxiv icon

Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models

Add code
May 22, 2023
Figure 1 for Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models
Figure 2 for Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models
Figure 3 for Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models
Figure 4 for Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models
Viaarxiv icon

A Learning-Theoretic Framework for Certified Auditing of Machine Learning Models

Add code
Jun 09, 2022
Figure 1 for A Learning-Theoretic Framework for Certified Auditing of Machine Learning Models
Figure 2 for A Learning-Theoretic Framework for Certified Auditing of Machine Learning Models
Figure 3 for A Learning-Theoretic Framework for Certified Auditing of Machine Learning Models
Figure 4 for A Learning-Theoretic Framework for Certified Auditing of Machine Learning Models
Viaarxiv icon